CMV MATRICES: FIVE YEARS AFTER 



BARRY SIMON* 

Abstract. CMV matrices are the unitary analog of Jacobi ma- 
trices; we review their general theory. 



1. Introduction 

The Arnold Principle: If a notion bears a personal name, then this name is not the 
name of the inventor. 

The Berry Principle: The Arnold Principle is applicable to itself. 

- V. I. Arnold 
On Teaching Mathematics, 1997 |H] 
(Arnold says that Berry formulated these principles.) 

In 1848, Jacobi 42] initiated the study of quadratic forms 
J(x 1 , ...,x n ) = J2k=i ^k x k + 2 J2k=l a kXkXk+i, that is, essentially nxn 
matrices of the form 

fbx a x ... 0\ 
fli 62 ci2 ... 
a 2 63 ••• 



J 



\0 a n _i b n J 

and found that the eigenvalues of J were the zeros of the denominator 
of the continued fraction 

< L2 > 

61 - z T 



bo - z 



In the era of the birth of the spectral theorem, Toeplitz [7^] , Hellinger- 
Toeplitz i44j, and especially Stone [75] realized that Jacobi matrices 
were universal models of selfadjoint operators, A, with a cyclic vector, 
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To avoid technicalities, consider the case where A is bounded, 
and suppose initially that TC is infinite-dimensional. By cyclicity, 
{A k (p }^ =0 are linearly independent, so by applying Gram-Schmidt to 
(fo, Aip , A 2 ip , . . . , we get polynomials Pj(A) of degree exactly j with 
positive leading coefficients so that 

tPj=Pj(A)(p (1.3) 

are an orthonormal basis for 7i. By construction, 

ipj J_ (fo,A(p , . . . , A 3 ~ x cp 

so 

(<Pj,A<p k )=0 j>k + 2 (1.4) 
Because A is selfadjoint, we see (ipj, Aip k ) = also if j < k — 2. Thus, 
the matrix (ipj,A(pj.) has exactly the form (|1.1|) where a,j > (since 
Pj(A) has leading positive coefficient). 

Put differently, for all A, ip , there is a unitary U : H — > £ 2 (given 
by Fourier components in the (pj basis), so UAU" 1 has the form J and 
ipo = (1,0,0,...)*. The Jacobi parameters, {fln,&n}^i, are intrinsic, 
which shows there is exactly one J (with (fo = (1, 0,0,... )*) in the 
unitary equivalence class of (A,tp ). 

There is, of course, another way of describing unitary invariants for 
(A,<po): the spectral measure d\i defined by 

J x n dfi(x) = (vo,A n vo) (1.5) 

There is a direct link from d\x to the Jacobi parameters: the Pj(x) are 
orthonormal polynomials associated to d/u,, and the Jacobi parameters 
are associated to the three-term recursion relation obeyed by the p's: 

xpj(x) = a j+1 p j+1 + b j+ ipj(x) + ajPj^x) (1.6) 

(where p_i = 0). 

Here we are interested in the analog of these structures for unitary 
matrices. We begin by remarking that for a general normal opera- 
tor, N, the right form of cyclicity is that {N k (N*) e (p } ( ^ e=0 is total. 
Since A = A*, only {A k ip }'^ =0 enters. Since U* = Z7 _1 , for unitaries 
U k (U*Y = U k ~ e and the right notion of cyclicity is that {U k (p } < £ = _ 00 
is total. 

Some parts of the above four-fold equivalence: 

(1) unitary equivalence classes of (A, ip ) 

(2) spectral measures, that is, probability measures dfi on R with 
bounded support and infinite support 

(3) Jacobi parameters 

(4) Jacobi matrices 
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are immediate for the unitary case. Namely, (1) ^ (2) holds since there 
is a spectral theorem for unitaries, and so, a one-one correspondence 
between unitary equivalence classes of (U, <po) on infinite-dimensional 
spaces and probability measures on dB> (IB) = {z \ \z\ < 1}) with infinite 
support. 

More subtle is the analog of Jacobi parameters. Starting from such 
a probability measure on <9IB>, one can form the monic orthogonal poly- 
nomials § n (z) and find (see Szego [77j; see also Section 1.5 of [69 ) 
{a n }~ o G so 

z$ n (z) = + a n z n Wjl/f) (1.7) 

While Verblunsky [HI] defined the a n in a different (albeit equivalent) 
way, he proved a theorem (called Verblusnky's theorem in |J9]; see also 
jHEl) that says this map dfi — > {a n }™ =0 is one-one and onto all of D°°, 
so (l)-(3) for the unitary case have been well understood for sixty-five 
years. 

Surprisingly, (4) (i.e., the canonical matrix form for unitaries) is of 
much more recent vintage. The key paper by Cantero, Moral, and 
Velazquez ^2] was submitted in April 2001 — so we are witnessing five 
years of study in the area — it is reviewing these developments that 
is the main scope of this review article. Spectral theory of differential 
and difference operators has been an enduring theme of Des Evans' 
research and I am pleased to dedicate this review to him. 

There is an "obvious" matrix to try, namely, Gkl = (<Pk, Zl fe) with 
(fk — $k/\\®k\\ the orthonormal polynomials. This GGT matrix (as it 
is named in 69j; see Section ITU1 below) has two defects. First, {y^}^ 

is a basis if and only if X]^Lol a «l 2 = 00 > anc ^ ^ ^ ^ s no ^' &ke is n °t 
unitary and is not conjugate to multiplication by z (in that case, one 
can look at the minimal dilation of Q, which is discussed in IB^j). 
Second, it obeys (jl.4|) . but in general, (<pj, U*ifk) ^ for all j > k + 1, 
that is, Q is not of finite width measured from the diagonal. CMV ^2] 
has the following critical ideas: 

(a) The basis Xk obtained by orthonormalizing 1 

can be written in terms of (fe(z), tpe(l/z), and powers of z. 

(b) The matrix Cke = (Xk, z Xe) is unitary and five-diagonal. 

(c) C can be factorized into C = CM. where £ is a direct sum of 2 x 2 
unitary matrices and Ai the direct sum of a single lxl and 2x2 
matrices. 

It turns out that these key ideas appeared about ten years earlier 
in the numeric matrix literature (still, of course, much later than the 
1930's resolution of (l)-(3)). Intimately related to this history is what 
we will call the AGR factorization in Section^^ — the ability to write Q 
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in the case of n x n matrices as a product O . . . O n _iO n _i of matrices 
with a single 2x2 block placed in 1 and a finite matrix which is 
diagonal, differing from 1 in a single place (see Section ^2 for details). 

In 1986, Ammar, Gragg, and Reichel [5] found the AGR factoriza- 
tion for orthogonal matrices — here the ocj are real and the O(oj-) are 
reflections, so the AGR factorization can be viewed as an iteration of 
a Householder algorithm. In this paper, they also had a proof of the 
CM factorization for this case. This proof (a variant of which appears 
in Section ITU}) , which works in general to go from the AGR factoriza- 
tion of the GGT matrix to an CM factorization, was only given in the 
orthogonal case since they did not yet have the AGR factorization for 
general unitaries. 

In 1988 (published 1991), AGR extended the AGR factorization to 
the general unitary case and realized the connection to Szego recursion. 
While they could have proven an CM factorization from this using the 
method in [S], they did not and the general CM factorization only 
appeared in [TT| . 

In 1991, Bunse-Gerstner and Eisner ^1] found the CM factorization 
for a general finite unitary and noted it was a five-diagonal representa- 
tion. Watkins |H2] codified and cleaned up those results and emphasized 
the connection to OPUC and found a proof of Szego recursion from the 
CM factorization. Virtually all the main results from ^2] are already 
in Watkins [H21- 

We will continue to use the name CMV matrices, in part because the 
analytic revolution we discuss here was ushered in by their work and in 
part because the name has been used now in many, many publications. 

Here is a summary of the rest of this review. Section |21 presents 
the basics, essentially notation and (a)-(c) above. Section El discusses 
"other" CMV matrices. In particular, we consider two kinds of fi- 
nite variants. In the selfadjoint case, restricting the matrix by taking 
the first n rows and columns preserves selfadjointness but the analog 
for unitaries does not, and we have both the nonunitary cutoff CMV 
matrices obtained from the first n rows and columns and the unitary 
finite CMV matrices which are models of finite unitary matrices with 
a cyclic vector. Section 0] discusses CMV matrices for matrix-valued 
measures — something that is new here. Section discusses the effect 
on Verblunsky coefficients of rank one multiplication perturbations, 
and Section El the formula for the resolvent of the CMV matrices, the 
analog of well-known Green's function formulae for Jacobi matrices. 
Sections and El discuss perturbation results, and Section |H] a general 
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theorem on the essential spectrum of CMV matrices. Section El dis- 
cusses the AGR factorization discussed above as preparation for the 
Killip-Nenciu discussion of five-diagonal models for /^-distribution of 
eigenvalues, the subject of Section[TTJ Section H21 discusses the defocus- 
ing AL flows, which bear the same relation to CMV matrices as Toda 
flows do to Jacobi matrices. Finally, Section [13] discusses a natural re- 
duction of CMV matrices to a direct sum of two Jacobi matrices when 
all Verblunsky coefficients are real. 

We do not discuss the use of CMV matrices to compute the zeros 
of OPUC. These zeros are the eigenvalues of the cutoff CMV matrix. 
We note that this method of computing zeros was used in the recent 
paper of Martinez-Finkelshtein, McLaughlin, and Saff [57] . Numerical 
aspects of CMV matrices deserve further study. 

While this is primarily a review article, there are numerous new 
results, including: 

(1) an analysis of what matrices occur as cutoff CMV matrices (Sec- 
tion El) 

(2) an analysis following Watkins [B2] of the CM. factorization without 
recourse to Szego recursion (Section |SJ) 

(3) the basics of CMV matrices for matrix-valued measures (Section^) 

(4) a new proof of AGR factorization using intermediate bases (Sec- 
tion EH) 

(5) a new trace class estimate for GGT matrices that relies on AGR 
factorization (Section HUJ) 

(6) a reworked proof of the Killip-Nenciu jJU] theorem on the mea- 
sure that Haar measure on U(n) induces on Verblunsky coefficients 
(Section HU) 

(7) an argument of AGR is made explicit and streamlined to go from 
AGR to CM. factorization (Section [TUJ) 

It is a pleasure to thank M. Cantero, P. Deift, L. Golinskii, 
F. Marcellan, A. Martinez-Finkelshtein, L. Moral, I. Nenciu, P. Nevai, 
L. Velazquez, and D. Watkins for useful input. 

2. CMV Matrices: The Basics 

In this section, we define the CMV basis, the CMV matrix, and the 
CA4 factorization. 

CMV matrices can be thought of in terms of unitary matrices or 
OPUC. We start with the OPUC point of view. A measure <i/i in dV> is 
called nontrivial if it is not supported on a finite set; equivalently, if ev- 
ery polynomial, which is not identically zero, is nonzero in L 2 (dB>, dii). 
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Then one can define orthonormal polynomials, <p n {z) (or tp n (z, dfi)), by 

(i) (p n (z) = K n z n + lower order; n n > (2.1) 

(ii) ( p n ±{l,z i ^ i ... i z n - 1 } (2.2) 

We define the monic polynomials $ n (z) by $ n (z) = (p n (z)/K n . 
The Szego dual is defined by 



P*{z)=z n P n {l/z) (2.3) 

that is, 



n 



P n (z) = CjZ j =► P:(z) = J2 Cn-jZ j (2.4) 

j=0 j=0 

The symbol * is n-dependent and is sometimes applied to polynomials 
of degree at most n, making the notation ambiguous! 

Then there are constants {a n }^L in D, called Verblunsky coefficients, 
(sometimes we will write a n (dfi)) so that 

Pn ¥n+i{z) = z(p n {z) - a n ip* n (z) (2.5) 

where 

p n = (1 - K| 2 ) 1/2 (2-6) 
Moreover, \i — > {a n }™ =0 sets up a one-one correspondence between 
nontrivial measures on <9ID> and points of ©°° (as we will show below). 
(|2.5jl (called Szego recursion after [77]) and this one-one correspondence 
are discussed in |B9 ] 170]; see also [HE]- Applying * for P n +\ to (|2.5|) . we 
get 

Pn <Pn+l(z) = V*n( Z ) - OnZ<P n (z) (2.7) 

If one defines (of course, p = (1 — la) 2 ) 1 / 2 ) 



1 



2 —a 



then (|iDijl / (f27rjl can be written 



v<Pn+l/ \<Pn 

Since det(A) = z, we have 



— ^ (2.9) 



2! 



and thus 



p n <Pn(^) = (2.11) 

Pn <PnW = (V*n + li Z ) + a n ^n+l(^)) (2.12) 
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Introduce the notation [yi,---,yk] for the span of the vectors 
. . . , yk and P[ yi) ..., yk \ for the projection onto the space [2/1, . . . , 2/fc]- 
For x [2/1, . . . ,2/fe], define 

[*;vi,-,Vk]= iin"p" ] | X |i ( 2 - 13 ) 

the normalized projection of a; onto [2/1, . . . ,2/fc]" 1 , that is, the result of 
adding x to a Gram-Schmidt procedure. 
We define n n to be -P[i v .. j2 «-i]. 

By the definition of (p n and the fact that * is anti-unitary on Ran 7r n 
and takes z- 7 to z n ~\ we have 

VJb =[« b ;1,... ) « b - 1 ] < = [1;z,...,^] (2.14) 

With this notation out of the way, we can define the CMV basis 
{Xn}^=o and alternate CMV basis {x n }^ =0 as the Laurent polynomials 
(i.e., polynomials in z and obtained by applying Gram-Schmidt to 
1, z, z -1 , z 2 , z~ 2 , . . . and 1, z' 1 , z, z~ 2 , z 2 , . . . , that is, for k = 0, 1, ... , 

x 2 fc = i,z, ... , x2*-i = [^ fc ; 1, 2, • • • , z fe ~\ ^r fc+1 ] 

(2.15) 

x 2k = \z k \ 1, z~\ . . . , z^ 1 , 2T fe ] x 2fc _i = [2T fe ; 1, z~\ . . . , z" fc+1 , z*" 1 ] 

(2.16) 

So, in particular, as functions in L 2 (dU>, d/i), 

X n = Xn (2.17) 

and as Laurent polynomials, 



x n (z) = X n(l/z) (2.18) 

As realized by CMV [12], the {Xn}^=o an d {i n }^° = o are always a basis 
of L 2 (<9B, cf/i) since the Laurent polynomials are dense on C(<9D), while 
{ip n }^ =0 may or may not be a basis (it is known that this is a basis if 
and only if X]J a n| 2 = 00 > see Theorem 1.5.7 of |69|). On the other 
hand, the x an d x bases can be expressed in terms of ip and ip* by 
(|2.14|) and the fact that multiplication by zr is unitary. For example, 

ry> _ r -fcr 2k k fc-1 2fc-l 11 

= z- fc [z 2fc ;l,...,^- 1 ] 
= z~ k ip 2k (z) 

The full set is 

X2k(z) = z- k ip* 2k (z) X2k-i(z) = z- k+1 i f2k _ 1 (z) (2.19) 
x 2 k(z) = z- k <p 2k (z) x 2k ^(z) = z~Vm-i(*) (2.20) 
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Since \ and x are bases, the matrices of multiplication by z in these 
bases are unitary. So we have the unitary matrices 

Cml = {Xm,ZXi) ( 2 - 21 ) 

Cmi = (x m ,zx e ) (2.22) 

called the CMV matrix and the alternate CMV matrix, respectively. 
By (12.18)1 . the unitarity of C and z = z~ l , we see 

Cmk Ckm (2.23) 

that is, C and C are transposes of each other. We will see shortly that 
C is five- diagonal, but this follows now by noting that both z and z~ x 

map [xo, • • • , Xk] in to [xo, • • • , Xfc+a]- 

CMV [T2] . Ammar-Gragg-Reichel [5], Bunse-Gerstner and Eisner 
[TTj . and Watkins [H2] also discussed the important factorization C = 
CM as follows: 

£mk = (Xm, ZX k ) M m k = (x m , Xk) (2.24) 

Since {x k }f =l is a basis, (f,g) = Y,T=o(fi x k)(%k,g), and thus 

C = CM C = ML (2.25) 

The point of this factorization is that C and M have a simpler structure 
than C. Indeed, £ is a direct sum of 2 x 2 blocks and M of a single 
lxl block and then 2x2 blocks. 

One can (and we will) see this based on calculations, but it is worth 
seeing why it is true in terms of the structure of the CMV and alternate 
CMV basis. Notice that X2n-i and X2n span the two-dimensional space 
[1, z, z -1 , . . . , z n , z~ n ] fl [1, z, z~ x , . . . , z n ~ 1 , z - ™" 1 " 1 ]- 1 - and so do x 2 „_i and 
X2n- This shows that M is a direct sum of li x i and 2x2 matrices. 
Similarly, \2n and X2n+i span [1, . . . , z~ n , z n+1 ] D [1, . . . , z~ n+1 , z 11 }- 1 , as 
do zx2n and zx 2n +i (even for n = 0). Thus £ has a 2 x 2 block structure. 

In fact, we can use Szego recursion in the form (|2.5|) . 1)2.70 . ()2.11|) . 
([2.12)1 to find the 2x2 matrices explicitly. For example, taking ()2.12j) 
for n = 2m — 1, we get 

^2m = P2n-l<^2m-l ~ ^2m-l^2m 

and multiplying by z~ m yields (by TFXty I ^TM ). 

X2m — ~Ct2m-\^2m + P2m-l%2m-l 

This plus similar calculations imply 
Theorem 2.1 ( HI 1121 E2] ) • Let 

O(a) =(* (2.26) 
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Then C = CA4 and 

C = 0(a o ) © 9(a 2 ) © ®M © ■ ■ • © ©(«2m) © • • ■ (2.27) 

and 

M = li x i © e(ai) © e(a 3 ) © • • • © 0(a 2m +i) © ■ ■ ■ (2.28) 

Doing the multiplication yields 

/ a «ip PiPo 

Po -«i« -pi«o 

^ _ a 2 pi -a 2 a t 

p 2 pi -P2«i 


V 

C is five-diagonal, that is, only nonzero in those diagonals Ckk+j with 
j = 0, ±1, ±2. Notice that half of the elements with j = ±2 are zero, 
so it is only "barely" five-diagonal — and it cannot be tridiagonal or 
even four- diagonal since 

Proposition 2.2 ([El)- If {^-jk}i<j,k<oo is a semi-infinite unitary ma- 
trix and 

k-j £ {-l,...,n} A jk = 
then A is a direct sum of finite blocks of size at most n + l. 

This was proven for n = 1 in ^U] and conjectured for n = 2 in a 
draft of [ni] before motivating [T5] . 

While our construction has been for a n 's which come from a dp and, 
in particular, which obey 

\aj(dii)\ < 1 (2.30) 

defines a unitary so long as \a n \ < 1. We thus define a CMV matrix 
to be a matrix of the form ()2.25|) - ()2.29|) for any {a n }^L Q with \a n \ < 1. 
If \a n \ < 1 for all n, we call C a proper CMV matrix, and if |a n | = 1 
for some n, we call it an improper CMV matrix. 

To state the analog of Stone's selfadjoint cyclic model theorem, we 
need another definition. A cyclic unitary model is a unitary operator, 
U, on a (separable) Hilbert space, TC, with a distinguished unit vector, 
vq, which is cyclic, that is, finite linear combinations of {U n vo}'^ ) = _ 00 
are dense in 7i. We call the model proper if dim(7i) = 00 and improper 
if dim(7i) < 00. It is easy to see that the model is improper if and only 
if P(U) = for some polynomial, P, which can be taken to have degree 
dim(7i) — 1. Two cyclic unitary models, (H,U,v ) and (H,U,v ), are 







"3P2 P3P2 

-a 3 a 2 -p 3 «2 

OiApz — 0:4,0:3 



(2.29) 



10 



B. SIMON 



called equivalent if and only if there is a unitary W from 7i onto Ti so 
that 

Wv = v WUW' 1 = U (2.31) 

Theorem 2.3. There is a one-one correspondence between proper 
cyclic unitary models and proper CMV matrices, C, in that So = 
(1, 0, 0, . . . )* is cyclic for any such C and every equivalence class con- 
tains exactly one proper CMV model: (£ 2 ,C,Sq). 

Remarks. 1. There is behind this a four- fold equivalence: 

(i) equivalence classes of proper cyclic unitary models 

(ii) nontrivial probability measures on <9D 

(iii) Verblunsky coefficients {a n (dp)}^L in D°° 

(iv) proper CMV matrices. 

The spectral theorem sets up a one-one correspondence between (i) and 
(ii), while the definition of CMV matrices between (iii) and (iv). Szego 
recursion sets up a map from dp, to {a n (dp)}™ =1 . As we will show, each 
(£ 2 , C, So) is a cyclic model, so the key remaining fact is the uniqueness. 

2. A corollary of this is Verblunsky 's theorem (also called "Favard's 
theorem for the unit circle") that each {a n }^L G © is the Verblunsky 
coefficient for some dp,. See Id'H] for further discussion and other 
proofs. 

Proof. As explained in Remark 1, we need only prove that any proper 
CMV matrix has So as a cyclic vector, and that if {ai°^}^L are the 
Verblunsky coefficients for C and dp the spectral measure for So, then 

a n (dp) = o4 0) (2.32) 

Let S n be the unit vector in £ 2 with coefficient 1 in place n and else- 
where; index labelling for our vectors starts at 0. By direct calculations 
using the CM. representation, 

C n+1 S - p^p? ■ ■ ■ P { £s 2n+1 e [S , S 2n ] (2.33) 

(C*)% - Po° ] • • • pS_^ 2n G [S Q , 5 2n _i] (2.34) 

It follows that So is cyclic and 

Xn(dp) = WS n (2.35) 

where W is the spectral representation from £ 2 to L 2 (dV>,dp). ()2.35|) 
follows from ()2.33|) - (j2.34j) . induction, and the Gram-Schmidt definition 

of X- 

By <mSB and 

(S , CS ) = a { 0) (xo, zxo) = a (dp) 
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(X2n-2 ? ZX2n-l) — a 2n-lP2n-2 

(X2n,ZX2n-l) = «2n(^)P2n-l (dfi) 



□ 



3. Cutoff, Finite, Two-Sided, Periodic, and Floquet CMV 



In this section, we will discuss various matrices constructed from or 
related to CMV matrices. Some are finite, and in that case, we will 
also discuss the associated characteristic polynomial which turns out 
to be equal or related to the basic ordinary or Laurent polynomials of 
OPUC: the monic orthogonal and paraorthogonal polynomials and the 
discriminant. The basic objects we will discuss are: 

(i) Cutoff CMV matrices, that is, ii n Cii n where n n is projection onto 
the span of the first n of 1, z, z~ l , .... 

(ii) Finite CMV matrices, the upper nxn block of an improper CMV 
matrix with a n _i G dD. 

(iii) Two-sided CMV matrices defined for {(y n }'^ = _ 00 via extending C 
and Ai in the obvious way to a two-sided form. 

(iv) Periodic CMV matrices. The special case of two-sided CMV ma- 
trices when a n+p = a n for some p. 

(v) Floquet CMV matrices. Periodic CMV matrices have a direct in- 
tegral decomposition whose fibers are pxp matrices that are finite 
CMV matrices with a few changed matrix elements. 

Cutoff CMV matrices. A cutoff CMV matrix is the restriction of a 
proper CMV matrix to the upper nxn block, that is, top n rows and 
leftmost n columns. We use to denote the cutoff matrix associated 
to C. A glance at ()2.29|) shows that depends on {otj}™Zo- Here is 
a key fact: 

Proposition 3.1. Let <& n (z) be the monic orthogonal polynomial asso- 
ciated to C (i.e., $ n = K^ipn). Then 



Proof. If 7i n is the projection onto [1, . . . , 2 n_1 ] and n n on the span of 
the first n of 1, z, z~ x , . . . , then ir n and n n are unitarily equivalent under 
a power of z. So if M z f = zf, then n n M z 7T n and 



Matrices 



$ n (z) = det(zl - C (n) ) 



(3.1) 



TT n M z % n EE C< n > 

are unitarily equivalent, and thus, ()3.1|) is equivalent to 

$ n (z) = det(>l - n n M z 7i n ) 



(3.2) 



(3.3) 
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Let Zj be a zero of $ n of multiplicity kj and let Pj(z) = $ n (z)/(z — 
Zj) kj . Then with A = n n M z 7[ n , we have 

(A - Zjf'P, = (A - Zjf '/' ; ^ 

Thus, as Zj runs through the distinct zeros, {(A — Zj) l Pj \ £ = 
0, 1, . . . , kj — 1} gives us a Jordan basis in which A has a fcj x fcj block 
for each Zj of the form 

fzj 1 0\ 

z, 1 ... 



1 

\0 zj 

and thus 

det(z - A) = l[(z - Zj ) k i = $ n (z) □ 
Corollary 3.2. The zeros of $ n (z) lie in D. 

Remark. See Section 1.7 of [HH] for six other proofs of this theorem. 

Proof. Let A = 7r n M z 7T n . Then < 1, so obviously, eigenvalues lie 
in D. If Ar] = z r] with r] G Ran7r n and z G <9D, then \\Ar]\\ = \\t)\\, so 
7i n zr] = zrj and thus as polynomials (z— z )r] = 0. Since the polynomials 
are a division ring, rj = 0, that is, there are no eigenvalues on <9D. □ 

To classify cutoff CMV matrices, we need to understand how 0(a) 
arises from a 2 x 2 change of basis. 

Lemma 3.3. Let f,g be two independent unit vectors with 

{f,g) = <* (3-4) 

Let <pi, (p2 be the result of applying Gram-Schmidt to f, g and ipi, ip2 to 
g, f. Let M be the matrix of change of basis 

fi=m 11 ^ 1 +m 12 iJ 2 (3.5) 

(p 2 = m 2 iipi + m 22 %l) 2 (3.6) 

Then a G D and M = 0(a). 

Proof. \a\ < 1 by the Schwarz inequality and the independence of / 
and g. Note that 

\\9- (f,9)f\\ 2 =\\9\\ 2 + \(f,9)\ 2 \\f\\ 2 -2Re[(f,g)(g,f)} 

-i I 1 2 2 

= 1 — \a\ — p 

so 

<p 2 = p- 1 (g-af) (3.7) 
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ip 2 = p- 1 (f-ag) (3.8) 
From this, a direct calculation shows that 



mn = 




<pi) 


= = a 






m 12 = 




<pi) 


= p- 1 (i-N 2 ) 


1 = P 




m 21 = 






= p- 1 (l-N 2 ) 


1 = P 




m 22 = 


<V> 2; 


¥>2> 


= p~ 2 (a + a\a\ 


2 -2a) = -a 


□ 



Remark. One can use this lemma to deduce the form of C and M in 
the CM factorization without recourse to Szego recursion, and then use 
their form to deduce the Szego recursion. This is precisely Watkins' 
approach to the factorization and Szego recursion. 

Given any matrix A and vector Sq, define 

V k = span(5 , A5 , A*5 , A k 5 , (A*) k 5 ) (3.9) 

which has dimension 2k + 1 if and only if the vectors are independent. 

Here is a complete classification of cutoff CMV matrices analogous 
to Theorem 12.31 

Theorem 3.4. Let A be an n x n cutoff CMV matrix with 5q = 
(1,0,..., 0)*. Then: 

(i) If n = 2k + 1, Vk has dimension n. If n — 2k, then span[\4_i U 
{A fc 5o}] has dimension n. 

(ii) If n = 2k + 1, A* is an isometry on span(\4_i U {{A) k 5 }) and 
A is an isometry on span(\4_i U {{A*) k 5o}) . If n — 2k, A* is an 
isometry on span(\4_2 U {A 5q, A k 5o}) and A is an isometry on 

(hi) \\A\\ < 1. 

(iv) A is not unitary. 

Conversely, if Aq, 5q are a pair of an n x n matrix and vector Sq 
obeying (i)-(iv), then there is a basis in which So — (1, 0, . . . , 0)' and A 
is a cutoff CMV matrix. 

(A, Sq) determine the Verblunsky coefficients (a , . . . , a n _i) uniquely. 
In particular, two cutoff CMV matrices with distinct {aj}" =0 are not 
unitarily equivalent by a unitary preserving 5$. 

Proof. Suppose first that A is a cutoff CMV matrix, that is, A = 7r n C7r n . 
By definition of 7T n , 

n n C j 5 = C j 5 n n (C*yS = (C*)% 

for j = 0, 1, . . . , k and £ — 0, 1, . . . , k (resp., k — 1) if n = 2k + 1 (resp., 
2k). It follows that for those values of j and £, 

&6 = A^So (C*) e 5 = (A*) e 5 (3.10) 
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so that (i) holds. 

This also shows that A*A>8q = A^~ 1 5q for j = 1, . . . , k, and from 
this and (j3.10j) . it follows that A* is unitary on span{ A^S } k =0 U 

{(A*Y5q}^~q ^ or k ~ 2 \ Similarly, we get the unitarity result for A. 

(hi) is obvious since ||7r n || = ||C|| = 1 and (iv) follows since there is a 
vector (p in Ran(7r n ) with 7T n C(p ^ dp. This completes the proof of the 
first paragraph of the theorem. 

As a preliminary to the converse, we note that 7r n commutes with 
either £ or M, so a finite CMV matrix has the form C n M n where if 
n = 2k + 1 is odd (1 = 1x1 identity matrix), 

C n = 0(oo) © • ■ ■ © 0(a 2fe - 2 ) © a 2k l (3.11) 

M n = 1 © e(ai) © ■ • • © 6(a 2fc _i) (3.12) 

and if n = 2k is even, 

C n = e(a ) © • • • © e(a 2fe - 2 ) (3.13) 
M n = 1 © 0(ai) © • • • © 6(a 2fc _ 3 ) © a 2fe _il (3.14) 

We will prove that when A obeys (i)-(iv), then A has an C n M. n factor- 
ization with parameter atj given intrinsically by A. This will not only 
prove the converse but the uniqueness of the map from {aj}^ 1 to 
cutoff CMV matrices, and so it will complete the proof of the theorem. 

We first consider the case n = 2k + 1 odd. Define xe t° be the 
basis obtained by Gram-Schmidt on 6 , A6 , A*5 , . . . , A k 6 , (A*) k 5 
(this is possible because (i) implies these vectors are linearly in- 
dependent) and define xe to be the result of Gram-Schmidt on 
5 , A*5 , A5 , . . . , (A*) k 5 , A k 5 . Then if A is written in xi basis, 

A = CM (3.15) 

where 

Mm = (xk,Xi) £>M = (Xk,Ax £ ) (3.16) 
We need to show that C,M have the form (l3~Tll / (ETHZJ) . 

If P m is the projection to the orthogonal complement of V m _i and 
/ = P m ^(A*) m 5 /\\P m ^(A*) m 5 \\ and g = P^A^o/ Pm-i^oll , 
then {x£, x i}e=m,m+i are given by Lemma So M has the form 
1 © 0(ai) © • • • © 0(a 2 fc~i) as required. 

Let We be the projection onto the span of the first 2£ 
of 5 , A5 , A*5 , A 2 5 , . . . and W e the span of the first 2£ of 
5o, A*5o, A5o, (A*) 2 5o, ■ ■ ■ ■ By hypothesis (ii), A is an isometry on 
Wi, W 2 , ■ ■ ■ , Wk, and by the same hypothesis, AA*ip = <p for ip = 
5 , A*5 , . . . , (A*) k 5 . So it follows that A maps W t to Wt for i = 
l,...,k. Thus, by Lemma 13.31 the 2k x 2k upper block of L is 
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0(«o) © @(«2) © • • • © 6(«2fc-2)- Since A and A* are contractions, 
L must have O's in the bottom and rightmost column, except for the 
lower corner. That corner value, call it a>2k, must have |«2fc| < 1 by (iii) 
and \a 2 k\ < 1 by (iv). Thus, we have the required LM. factorization if 
n = 2k + 1. 

Now let £ = 2k be even. Define xt as before, but define xe by Gram- 
Schmidt on A8 , S , A 2 5 , A*5 , A k 5 , (A*) k - 1 5 . Then A written in 
Xe basis has the form (J3.15)) where 

M k e=(xk,Axe) Cke=(Xk,Xi) (3.17) 
We need to show that C,M have the form $Hty / $nty . 

Define We to be the span of 5 Q , A5 , A*5 , . . . , A l 5 , (A e )*5 and W e 
the span of A80, 5o, . . . , (A*Y~ 1 5o, A 1+1 5q. As above, A is an isometry of 
We to W e , so M has the form l®6(o; 1 )©- • -0 0(0:2^-3) ©a 2 fc-il where 
|a 2 fc_i| < 1 by condition (iv). Similarly, L has a 6(a )©- • -©@(tt2A:-2) 
block structure. This proves (|3.13j) / (|3.14j) and completes the case n = 
2k. □ 

Remark. This theorem sets up a one-one correspondence between 
{ctjYjZl e D n and cutoff CMV matrices. 

Finite CMV matrices. As discussed in Section |21 C, originally 
defined for |<x, | < 1, has an extension to |<x, | < 1 via the formula 
for C,Ai. Since \atj \ = 1 implies pj = and 0(<x,- o ) is diagonal, if 
|aj | = 1, C({aj}) leaves C- 7o+1 (i.e., vectors if with ip k = if k > jo + 1) 
invariant and C \ a+1 is a (jo + 1) x {jo + 1) unitary matrix. If 
|ao|, • • • , |o; n _2| < 1 and |a n _i| = 1, the corresponding n x n matrix 
is called a finite n x n CMV matrix, C n ({ao, . . . , a n -2, a n _i}). C n has 
the form C n M n where fCTTTIt / (HTT^It or (|3^ / (|3jlt hold, and now 
G 83. 

If U is an n x n matrix and 5q a cyclic vector in the sense that 
{U m 5 }™ = _ 00 is total, S cannot be orthogonal to any eigenvector. So U 
has to have n distinct eigenvalues {Aj}™ =1 and the eigenvectors {V'jljLi 
obey |(^o, ^j)\ 2 = 7^ 0. The unitary invariants of the pair (U, 5q) are 
the spectral measures Y^=i a j^^ where {Ay}" =1 are arbitrary distinct 
points and the dj > have the single restriction Y^j=i a j = 1- Thus, 
the number of real parameters is n + (n — 1) = 2n — 1. The number 
of free parameters in an n x n finite CMV matrix is n — 1 complex 
numbers in © and one in 9D, that is, 2(n — 1) + 1 = 2n — 1. This 
suggests that 

Theorem 3.5. There is a one-one correspondence between unitary 
equivalence classes of n x n unitary matrices with a cyclic vector and 
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finite CMV matrices in that each equivalence class contains one CMV 
matrix {fixed by 8q — (1, 0, . . . , 0)') and two CMV matrices with distinct 
parameters are not unitarily equivalent by a unitary fixing (1, 0, ... , 0)*. 

The proof is identical to the proof of Theorem 13.41 except that A 
nonunitary is replaced by A unitary so = 1. As noted in the 

discussion after Lemma 13.31 this approach is close to that in Watkins 
[52] . This theorem is related to results in Ammar-Gragg-Reichel jS] 
and Killip-Nenciu The latter talk about matrices with CMV 

shape having the CMV form. 

Instead of the cutoff CMV matrix, ir n M z ii n , one can look at n n M z 7r n 
where n n is a not necessary selfadjoint projection. CMV ^3] have 
shown that finite CMV matrices have this form and that they are the 
only normal operators of this form. 

Two-sided CMV matrices. In a sense, CMV matrices are two- 
sided. For example, if a n = 0, C is unitarily equivalent to a two-sided 
shift since C k 5o = 52k-i and C~ h 5o = 62k- However, as structures, the 
matrix is semi-infinite and there is a cyclic vector which is often not 
true for two-sided matrices. Thus, there is an extension to "two-sided" 
examples. 

Let {an}^-oo be a two-sided sequence of numbers in D. Let H = 
^(Z), that is, two-sided sequences {wn}^_oo with X^-ool^l 2 < 00 • 
Let 0j(/3) be 0(/3) acting on the two indices j and j + 1. Define 

= JM{a,}f = _J (3.18) 

where 

00 

e 2H K'-i) 

j=-oo 
00 

j=-oo 

£ is called the extended CMV matrix. 

The extended CMV matrix was introduced in [f23 • Earlier, Bour- 
get, Howland, and Joye [TU] had considered some doubly infinite five- 
diagonal matrices which factor into a product of two direct sums of 
2x2 matrices, but the 2x2 blocks were general unitaries rather than 
6's. 

While £ is natural and important for the periodic case, we will also 
see that it arises in the theory of essential spectrum of C (see Section |HJ). 
One reason for the name "extended CMV matrix" is: 
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Proposition 3.6. Ifot-i = — 1? then £ is a direct sum on £ 2 (—oo, — 1]© 
£ 2 [0, oo) and S \ £ 2 [0, oo) is the CMV matrix C({a i }~ ) . Moreover, 
£ \ £ 2 (—oo, — 1] is unitarily equivalent to C({a_j_ 2 }'jLo) ■ 

Remark. £ 2 [0, oo) means those w £ £ 2 (Z) with u n = if n < and 
£ 2 (— oo, —1] those with u n = if n > — 1. 

Proof. 0(-l) = (V?)> so both £ and leave ^ 2 [0,oo) and 
t 2 {— oo, — 1] invariant. Thus, £ does. 

.M t ^ 2 [0,oo) = M and £ f ^ 2 [0,oo) = £, so £ \ £ 2 [0,oo) is 

n{n ; }; () ). 

For the restriction to £ 2 (— oo, —1], note first that (? J) @( a ) (? o) = 
0(— a). Thus, by labeling the basis backwards, £ is unitarily equiva- 
lent to something that looks very much like C({— a_j_2}j2 =0 ) except A4 
starts with —1, not 1. By the discussion in Section there is a unitary 
that flips the spin of this —1 and all the a/s. □ 

Changing a_! from its value to = — 1 is a perturbation of rank at 
most two, so by the Kato-Rosenblum theorem [HS], the a.c. spectrum 
of £ is that of a direct sum of two C's. Since these a.c. spectra are 
only restricted by simplicity, we see that the a.c. spectrum of £ has 
multiplicity at most 2, but is otherwise arbitrary: it can be partially 
multiplicity 0, partially 1, and partially 2. In particular, £ may not 
have a cyclic vector. 

It is a theorem of Simon jU7] that the singular spectrum of £ is 
simple. This is an analog of a theorem of Kac [J7J HI] and Gilbert 
jS3 EH] for Schrodinger operators. 

Periodic CMV matrices. If {aj}JL Q is a sequence of Verblunsky 
coefficients with 

a J+ p = oij (3.19) 

for j > 0, p fixed, and > 1, then aj has a unique extension to j £ Z 
obeying ()3.19|) . The corresponding £ is called a periodic CMV matrix. 
The theory is simpler if p is even, which we henceforth assume. As 
explained in [70], there are several ways to analyze odd p once one has 
understood even p. 

Associated to {aj}^Zl is a natural Laurent polynomial, called the 
discriminant, A(z; {a^}^,), 

A(z) = z- p/2 Tr(A(« p _!, z)A(a p - 2 , z) . . . A(a , z)) (3.20) 

where 

z —a 
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This is analyzed in Section 11.1 of [ZD]- A(z) is real on <9D and has 
positive leading coefficient. This means A (z) has p real parameters. 
This suggests the map from {<x,}j =0 (of real dimension 2p) to A is 
many to 1, with inverse images generically of dimension p (= 2p — p). 
This is in fact true: the inverse images are tori of dimension d < p (we 
will see what d is in a moment). They are called isospectral tori. 

For fixed {aj}^Z , A _1 ([— 2,2]) (which is the spectrum of £) 
lies in <9D and is naturally p closed intervals whose endpoints are 
A _1 ({— 2,2}). Generically (in a), the intervals are disjoint, that is, 
their complement (called the gaps) is p nonempty open intervals. In 
general, the number of open intervals in the gaps is d, the dimension 
of the isospectral torus. 

Floquet CMV matrices. If T: £ 2 (Z) -> f(Z) by (Tu) n = u n+p and 
p is even, and if a n = a n+p , then TMT~ X = M. and TCT^ 1 = C, so 

TST- 1 = £ (3.22) 

(We will not consider odd p in detail, but we note in that case, 
TMT- 1 = I and TCT' 1 = M so, since M 1 = M and l l = I (on 
account of G* = where * is transpose), we have that T£T~ X = £ l .) 

Since T and £ commute, they can be "simultaneously" diagonalized, 
in this case represented on a direct integral representation. One way 
of making this explicit is to define, for each (3 G <9IB>, the space 
the sequences {M n }^_oo obeying u n+p = [3u n . This is clearly a space 
of dimension p since {wri}£ ! L_ 00 mapping to {u n }^=o (i.e., restriction) 
maps if to C p . 

By ()3.22|) . £, which maps bounded sequences to bounded sequences, 
maps £f to if, and so defines a finite-dimensional operator £ p (j3) under 
the explicit relation of if mentioned above. One sees 

£M = C p M p (f3) (3.23) 

where 

C p = e (a ) © • • • © e p _ 2 (a p _ 2 ) (3.24) 

M p {6) = 0! (a) © ... © 6 p _ 3 K- 3 ) © Of i(ap-i) (3.25) 

where Op_ x (a) acts on <5 p _i and Sq, and in that (ordered) basis has the 
form 

{% 1) < 326 > 

£ P {I3) is called the Floquet CMV matrix. To make precise the con- 
nection to £, we define the unitary Fourier transform JF: £ 2 (Z) — > 
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L 2 (<9©, g; C p ), the set of L 2 functions on <9© with values in C p by 

oo 

(Fu) k (J3) = P^+nv ( 3 - 27 ) 

n=— oo 

Then 

(rer- 1 g){p) = £ p (p) g (p) (3.28) 

(For details, see Section 11.2 of [70].) 

Finally, we note a general relation of the eigenvalues of £ p (/3) and 
the discriminant, A(z), of ()3.19j) . For 2 G <9ID) is an eigenvalue of £ p (/3) 
if and only if there is (ui,uqY so that after a p-step transfer, we get 
/3(ui,uoY, that is, if and only if z^ 2 (3 is an eigenvalue of T p (z ). This 
is true if and only if Zq 2 T p (zq) has eigenvalues (3 and (3~ l if and only 
if A(zb) = P + P~ 1 . It follows that 

det(z-^,C9)) = (n^) [^ /2 [AW (3.29) 

for both sides are monic polynomials of degree p and they have the 
same zeros. 

4. CMV Matrices for Matrix- Valued Measures 

Because of applications to perturbations of periodic Jacobi and CMV 
matrices jT^], interest in matrix- valued measures (say, k x k matrices) 
has increased. Here we will provide the CMV basis and CMV matri- 
ces in this matrix-valued situation; these results are new here. Since 
adjoints of finite-dimensional matrices enter but we want to use * for 
Szego reversed polynomials, in this section we use ' for matrix adjoint. 

Measures which are nontrivial in a suitable sense are described by 
a sequence {aj}^L of Verblunsky coefficients that are k x k matrices 
with ||oj|| < 1. 

To jump to the punch line, we will see that C still has an CM. 
factorization, where 0(a) is the 2k x 2k matrix 

6(a) Q (4.1) 

where 

p L = (l-at«)V2 ^ = (i_ aa t)i/3 (4.2) 
It is an interesting calculation to check that is unitary, that is, 

(; (;« C) - {i s) <«» 
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That aa f + (p R ) 2 = 1 = (p L ) 2 + a f a follows from (jOjl . That ap L - 
p R a = p h a* — a^p R = follows by expanding the square roots in (|4.2j) 
in a Taylor series and using a(a^a) m = (aa^) m a. 

To describe the model specifically, we have a kxk matrix- valued (nor- 
malized, positive) measure which can be described as follows: d[i t {9) 
is a positive scalar measure on dH> and for a.e. (dfi t (9)) a matrix A{9) 
obeying 

A(6) > Tr(A(6)) = 1 (4.4) 

We write dfj,(6) = A(6) d/j,t(9). We assume dfi is normalized in the sense 
that J A{9) dfit = 1- We will consider to be the kxk matrix- valued 
functions, /, on dB> with 

J Ti:(/(0)U(0)/(0)) dm(9) < oo (4.5) 

The measure dfi is called nontrivial if 

dimfspan^^}^ 1 ] = nk 2 (4.6) 

for each n. Equivalently, for each n and {Be}™~y in £(C fc ), we have 
S&To = in implies B = B± = ■ ■ ■ = B n _i = 0. Also 
equivalent is that (<p, A(6*)<p) dfit(9) is nontrivial for each <p e C fe \{0}. 
Similarly, we define 7Yl to be /'s with 

J TT(f(9)A(9)p(9))df, t (9) <oo (4.7) 

It is easy to see that nontriviality implies (|4.6|) holds also in Ti^. 

We define two "inner products," sesquilinear forms from and Hl 
to C(C k ), the kxk matrices: 

«/,0»r= / f\9)dn{9)g{9) (4.8) 

«/,<?»l = | g(9)dn(9)p(9) (4.9) 

The right side of ()4.8|) is shorthand for 

| p(0)A(0)g(0)dm(9) 

so the LHS of JOJ) is Tr(((f, g)) R ). The symbols L, R (for left and 
right) come from 

((f,gB)) R =((f,g))B (4.10) 
({f,Bg)) L = B((f,g)) (4.11) 



for constant kxk matrices, S. 
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The normalized matrix OPUC, ip^, (p^, are polynomials in z of degree 
n with matrix coefficients with 

«<Pn, v£»R = <W (&n, ^»R = <W (4-12) 

This determines </2 uniquely up to a unitary right (resp., left) prefactor. 
We will pick this prefactor by demanding 

<Pn' h {z) = i^' L z n + lower order (4.13) 
«n+i(^) -1 >0 K)- l K^ +1 >0 (4.14) 

With this choice of normalization, one has a sequence of k x k matrices, 
{a„}™ =0 , and the recursion relations 

z<p L n = p L n p L n+1 + aU^y (4.15) 

= <Pn + lPn + {Vlfoi (4.16) 
(^r = (^ + l)^+*n (4-17) 
(<Pn)* = + <Wn (4.18) 

where are given by (|4.2|) and i^(-z) = z n P n (l/z)^ . For construc- 

tion of <^' R and proof of (1073) (ETTS)). see [Zj or Section 2.13] 
following Delsarte et al. jEj and Geronimo |29j . 

It will help to also have the following, which can be derived from 

= pfcrf - (4-19) 

(A-i)* = Pl^PD* - anvtn (4-20) 

Following and (l2~27H) . we define the CMV and alternate CMV 

basis by 

X2k {z) = z- k ( V l(z)T X 2k-i(z) = ^Vk-iC*) (4.21) 
x 2k (z) = z~\\ k {z) x 2k ^(z) = z- k {$k-i{z)T (4-22) 

Proposition 4.1. {xfX z )}e^o an< ^ { x t{ z )}^=o are ((' > orthonormal, 
that is, 

{{Xli Xm))h = Sim ((%£, X m ))h = S im (4.23) 

Moreover, \i is ^ n the module span of the first i ofl,z, z -1 , . . . and X£ 
ofl,z-\z, .... 

Remark. By module span of {fj{z)}™ = i of scalar functions, /, we mean 
elements in Hl of the form YlT=i Bjfj(z) where Bi, B 2 , . . . are fixed 
k x k matrices. 
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Proof. (JOB} for £ = m holds by (|3T2j) and (g2D), if we note 

that 

= «g„, p„» R (4.24) 

It is obvious from the definition of xi an d xi that they lie in the proper 
span. To get ()4.23|) for £ < m, we need to know that \t * s orthogonal 
to the first £ — 1 of 1, z, z~ l , . . . and xg to the first £ — 1 of 1, z, z~ l , .... 
For cases where xe or xe is given by (p L , this follows from ((z k , (p\)) = 
for < k < £ and when it is a (pf)* from (|4.24|) which says 

((z k ,(ipfy)h = ((tpf,z e - k )) = o 

for < £ - k < £. □ 

By a (left-) module basis for Hh, we mean a sequence {/j}j*i or- 
thonormal in ((• , that is, ((fj, fe))c = Sj£ so that as {Bj}jL runs 
through all iV-tuples of k x k matrices, 52j=o ^j'/i * s a sec L uence of sub- 
spaces whose union is dense in 7i L - For any such basis, any r\ e Hl 
has a unique convergent expansion, 

oo 
3=0 

{Xj}^=o an d { x i}j^=o are both module bases. That means, if Cj£ is 
defined by 

C je =(xj,zxt) (4.26) 

then the matrix, obtained by using the k x j blocks, Cji, is unitary. 
Moreover, 

C je = Y, C jrnM me (4.27) 
1/1 

where 

£ji = (Xj, zx e ) M jt = (xj, xe) (4.28) 
In (|4.19jh set n = 2k — 1 and multiply by z~ k to get 

X2k = ~alx2k + PnX2k-l (4.29) 

where a bottom row of is clear. In this way, using (j4.15|) . (j4.18j) . 
Of . and lj£20|) . one obtains: 

Theorem 4.2. ©^(a) (/wen &y ()4.1|) acting on C 2k corresponding 

to 5j, we have 

M = lixi © ©i(«i) © © 3 (tt 3 ) © ■ ■ • 
£ = e (a ) © e 2 (« 2 ) © 6 4 (a 4 ) © ■ ■ • 
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The analog of (E| is 



/ 



C 



\ 



Po 








— «o«{ 

R t 
pf«2 

Pfpf 





p\p\ 
-ttoPi 
—a\a\ 
-atipf 







P2"3 

— a2«3 
pfa\ 






P2P3 

-«2p3 

-a 3 a\ 



(4.30) 



We note for later purposes that for this matrix case, the GGT matrix, 
which we will discuss in Section EB has the form 



-ttfc-lPfcPfc+1 
,R 



■ Ol 



< k < 







that is, 



Po 



Po a \ 



— aoa 1 
Pi 



PoPl"2 

— aia\ 



t 



k = l 
k > 1 

P0P1PW3 

-«0PlP2«3 



+ 2 
+ 2 

\ 



(4.31) 



(4.32) 



5. Rank One Covariances 

For selfadjoint matrices, the most elementary rank one perturbations 
are diagonal, that is, J h> J + X(S n , ■ )6 n , where 5 n is the vector with 1 
in position n and elsewhere. The impact of such a change on Jacobi 
parameters is trivial: a m — > a m , b m — > b m + X5 nm (if we label vectors 
in the selfadjoint case starting at n = 1). One of our goals is to find 
the analog for CMV matrices, where we will see that the impact on 
Verblunsky coefficients is more subtle. 

We will also address a related issue: In the spectral theory of OPUC, 
the family of measures, dp\ with a n {dp\) = \a n for a fixed {a n }™ =0 , 
called an Aleksandrov family, plays an important role analogous to 
a change of boundary condition in ODE's. If <p n are the normalized 
OPUC, the GGT matrix, 



L=o) 



(5.1) 

(see [HI 
is easily 



Gke({a n } c r 

has the property that ^({Aanj^o) — Q \{a n }'^' =0 ) is rank one 
page 259]). But for the CMV basis, C({\a n }™ =0 ) -C({a n }™ 
seen to be infinite rank (if the a's are not mainly 0). However, we 
will see here that for a suitable U\ (depending on A but not on a!), 
U x C{{\a n }™ = )U^ - C({a n }™ =0 ) is rank one. 

We need to begin by figuring out what are natural rank one pertur- 
bations. The key realization is that the proper format is multiplicative: 
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Let P be a rank one projection and We = e ldP = (1 — P) + e ld P. Then 
Wg — 1 = (e* 61 — 1)P is rank one, and for any U, UWg is a rank one per- 
turbation of U. It will be convenient to parametrize by A = e ld e <9B. 
Thus, we define 

W^\X) = l + (e ie -l)(5 m ,-5 m ) (5.2) 

and given any CMV matrix C, we let 

C m {\) =CW (m) (\) (5.3) 

We will use C m (A; {a^}) where we want to make the a-dependence 
explicit. Notice that 

I XLik if k = m 

that is, we multiply column m by A. 

Part of the result we are heading towards is that 



A -1 a/(C) I > m 



In particular, C°(A) realizes the fact that C({Aafc}^ ) is unitarily equiv- 
alent to a rank one perturbation of C({ak}^ =0 ). ()5.5j) for the important 
case m = is due to Simon (HHl Theorem 4.2.9] and for the general 
case to Simon [72]. We will sketch the various proofs. 

While we will eventually provide explicit unitaries that show 
C m (A; {<x,-}°fl ) is unitarily equivalent to C (right side of (|5.5p ). we 
begin with a direct proof of ()5.5|) in case m = 0. 

Theorem 5.1. C m=0 (A; {<x,-}!*L ) /ias Verblunsky coefficients 
{A •«,•}?„■ 

Remark. If ,M (A) = Al © 0(ai) © 6(« 3 ) © • • • , that is, the 1 in the 
upper left corner is replaced by A, then CAi^ = CW^°'(\). 

Sketch. (See Theorems 4.2.9 and Subsection 1.4.16 of |69j.) By defini- 
tion, 

C rn=0 _ C = ( A _ 1 ) C p o ( 5 g) 

where Pq = (So, ■ )5q. Define, for z EB>, F\ and the f\ by 

FaM = (So, (Cr° - z)(CT° + (5.7) 

= ^TTT < 5 ' 8 > 

1 - zf\{z) 
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Using the second resolvent formula and (|5.fij) implies (see Subsec- 
tion 1.4.16 of HH) that 

f x (z) = A-VW (5-9) 

The Schur algorithm and Geronimus theorem (see Chapter 3 of jSHl) 
then imply ()5.5|) for m = 0. □ 

For discussion of the movement of eigenvalues under the perturba- 
tions of Theorem 15.11 see [UJE3I2II] and Theorem 3.2.17 of 
The key to an explicit unitary equivalence is the following. Let 

-w - (j °) HX) = (o ?) (510) 

Then, by a simple calculation, 

u{X)Q{\- 1 a)u{X) = A6(a) (5.11) 

z>(A)- 1 0(A- 1 a)z>(A)- 1 = A^G^) (5.12) 

Note that (j5.11j) does not use z/(A) and ^(A) -1 but u(X) in both places. 
Similarly, ()5.12j) has zz(A)" 1 in both places. In the full calculation, one 
does not use 

UCMU- 1 = (UCU-^iUMU- 1 ) 

but rather 

UCMU' 1 = {UCU^U^MU- 1 ) (5.13) 
We need a notation for diagonal matrices. D(l 2k (l\)°°) indicates 
the diagonal matrix with entries 1 2k times, then alternating l's and 
A's. Thus, 

W {m \X) = D(l m Xl°°) (5.14) 
Using ()5.14|) . (|5.13|) . (|5.1ip . and ()5.12|) . a direct calculation (see Sec- 
tion 5 of [72]) shows: 

Theorem 5.2. For n = 0, 1, 2, . . . , define 

Um-i = D(l 2k (lXD (5.15) 
U 2k = D(X 2k (lX)°°) (5.16) 



where 



Then 



ft = C 3 > " < 5 ' 18 > 

Xatj j > n 



U n C(T niX -i({ aj }™ ))U^ = C({a 3 }f =Q )W^\X) (5.19) 
In particular, ()5.5j) holds. 



2(> 
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Remarks. 1. It is important that 5q is an eigenvector of U n since 
Verblunsky coefficients involve a unitary and a cyclic vector. (J5.f 9j) 
also shows that CW (jl \\) has 5 cyclic vector. 

2. One can also ask about Verblunsky coefficients of 
W^ n \X)C{{aj}'^L Q ). Since Verblunsky coefficients are invariant under 
unitaries that have 5q as an eigenvector and 

w {n) c = w { - n) cw { - n \w {n) y l 

the Verblunsky coefficients of CW^ and W^ n 'C are the same. 

(J5.11|) . (J5.12|) . and ()5.13|) imply a result about extended CMV ma- 
trices. For A G dH>, let W(\) be the two-sided diagonal matrix with 
d,2j = 1, = A. Then 

Theorem 5.3. Let A G 00. Then W(A)£({aJ^_ JW(A)" 1 = 
£({Aa n }J 



Remark. In particular, spectral properties of ^({an}^.^) and 
^{Ao^}^!,^) are identical and a n — ► Act„, preserves isospectral tori 
in the periodic case. 

6. Resolvents of CMV Matrices 

In this section, we will present formulae for the resolvent of C anal- 
ogous to the Green's function formula for Jacobi matrices (see The- 
orem 4.4.3 of [EH!)- These formulae appeared first in Section 4.4 of 
[UUj . Similar formulae for GGT matrices appeared earlier in Geronimo- 
Teplyaev [33] (see also |3U E21) 

Clearly, we need an analog of Jost solutions. For OPUC, these were 
found by Golinskii-Nevai [41 who proved 



Theorem 6.1. Fix z G D. Let ip n be the normalized OPUC for a prob- 
ability measure dfi ond3, andip n the normalized OPUC for Verblunsky 
coefficients —a n (dfi) (so-called second kind polynomials) . Then 

oo 

J2 + Hz)Vn{z)\ 2 + \r n {z) - F(z)<p* n (z)\ 2 < OO (6.1) 

n=0 

where F is the Caratheodory function: 

F{z) = J (6-2) 

Remarks. 1. This is an analog of Weyl's formula; see (1.2.53) of [5T?] . 
2. See [JT] or [MJ Section 3.2] for a proof. 
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With this in mind, we define 



(6.3) 



z ip2£ n = 2£ 

■z~ l i>u-\ n = 2£-l 

z wul-1 n — 21 — 1 



Pn = Vn + F(z)x n (6.5) 

7T n = T„ + ^(^Xn (6-6) 

Then Theorem 4.4.1 of [HH] says: 
Theorem 6.2. VFe /jai>e /or z£D, 

x f(2z)- 1 xK^)Pfc(2 ; ) A; > ^ or A; = £ = 2n- 1 

K 2j jM |(2^)- 1 ^(^)x fc (z) £>£;or£; = £ = 2n l ' J 

As a special case, since 5 n = Xn(C)5o and |Xn(e* e )| = \ip n (e )\, we 
obtain from a spectral representation 



«e*)| 2 



d/x(0) = (2z n+1 y 1 Vn(z)[-r n (z) + F(z)tp'(z)] (6.8) 



As shown in remarks to Theorem 9.2.4 of 70^, this is equivalent 
to a formula of Khrushchev [35] for J ^^ |(^ n (e t6> )| 2 d/i(9). For an 
application of Theorem 16.21 see Stoiciu 



7. l p Perturbations 

In this section, we give some elementary estimates of Golinskii- 
Simon [12] on the l p norm of C({a n }™ =0 ) - C({/3 n }™ =0 ). (For defi- 
nition and background on Schatten p-classes, see Gohberg-Krein |3*H] 
and Simon |7Tj.) 

In this section, we will use these estimates to write the Szego function 
in terms of Fredholm determinants of the CMV matrices and to discuss 
scattering theory. Further applications appear in Section 

A diagonal matrix, A, has X p norm (z~2j\ a jj\ P ) 1 ^ P ) an d ll^llp is in- 
variant under multiplication by a unitary. So if A has only a nonva- 
nishing fcth principal diagonal, A has X p norm (2\^j\ a jj+k\ p ) 1 - Since 
( a i/ P + b i/ P + c i/p) < ( a + b + c )i/p 3 i-i/ P ( by Hdlder's inequality), we 
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see for tridiagonal matrices that 



\\A-B\\ p <3 l -^(j2\^- b v\ P ) 



1/p 

(7.1) 



This lets us estimate ||£({a n }^L ) — £({fl n }^ =0 )\\ p , and similarly for 
M. Using unitarity of C and M, \\CM - C'M'\\ P < ||(£ - C')M\\ P + 
\\C'(M - M')\\ p < \\C - C'\\ p + \\M - M'\\ p . So using (a 1 ^ + &Vp) < 
2 1 - 1 / p (a + b) 1 / p , we find 

Theorem 7.1 (= Theorem 4.3.2 of [69J). Let {a n }™ =0 , {(3 n }n=o be two 
sequences in O and let p n — (1 — la^l 2 ) 1 / 2 , a n = (1 — l/^ 2 ) 1 / 2 . T/ien 

/ OO s 1/p 

l|C(K}£o) - C({A»}£ ) ||p < e 1 - 1 ^ £ K - + | Pn - a n |M 

\ n=0 - 



(7.2) 



Remark. [HHl has the constants 2 for 1 < p < 2 and 2 • 3 1 1 ^ 2p for 
2 < v < oo, but the proof there actually shows 2 1 ~ 1 / p and 2 1 ~ 1 / p 3 1_1/2p . 
This improves the constant in (|7.2j) . 



To rephrase in terms of |a n — /3 n | only, we first note that 

< (1 - i? 2 )" 1 / 2 



sup 

\z\<R 



i.(l_| z |2)l/2 



and 1 1 ^ I — I w 1 1 < I z — w I to see that 



sup 1(1 - Iz) 2 ) 1 / 2 - (1 - l^l 2 ) 1 / 2 ) < (1 - R 2 )~ 1/2 \z - w\ (7.3) 

\z\,\w\<R 

We need to note that \y/a-Vb\ < y/\a - b\ and ||a| 2 -|/3| 2 | < 2\a-p\, 
so 

|(1 - \z\ 2 ) 1 / 2 - (1 - |^| 2 ) 1/2 | < V2 |« - W | 1/2 (7.4) 

Thus, 

Theorem 7.2. Let {a n }^ =0 and {/3 n }^ =0 be ^ wo sequences in ©°° and 
let p n = (l- K| 2 ) 1/2 ; a n = (1 - |/3„| 2 ) 1/2 . T/ien 
(a) 7/sup n |a n | < i? < 1 and sup n |/3 n | < i?, i/ien 

/ OO v 1/p 

l|C({«4^o)-C({^}^ =0 )||p < ^"^[^(I-^-^Vp £ K-^I'J 

^ n=0 ' 

(7.5) 
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(b) In general, for 1 < p < oo, 

/ 00 \ i/p 

l|C({a n }~ )-C({/U£ )\\ p < Q l - l ^( £ \a n -P n \ p +2^ 2 \a n -P n \^j 

(7.6) 



n=0 



One thing made possible by CMV matrices is scattering theory be- 
cause all CMV matrices act on the same space (£ 2 ({0, 1, 2, . . . })). This 
is an important tool made possible by the CMV matrix. Thus, 

Theorem 7.3. Suppose sup n |a n | < R < 1, sup n \/3 n \ < R < 1, and 

^2 I «n ~ Pn\ <oo (7.7) 

n=0 

Then, if Pa, c {-) is the projection onto the absolutely continuous subspace 
of an operator and C = C({a n }^i ), C = C({/3 n }^L ), then 

lim C n C- n P ac (C) 

n— >±oo 

exists and is a partial isometry with range P ac (C). In particular, C \ 
P ac (C) and C f P ac (C) are unitarily equivalent. 

Remarks. 1. This follows from the fact that C — C is trace class and 
from the Kato-Birman theorem [53] . 

2. If {a n }^L corresponds to 

df) 

df* = f(8) — + dfig (7.8) 

Z7T 

and {/3 n }^L corresponds to 

d9 

dv = g(6) — + dv 5 (7.9) 
then this theorem implies that up to sets of dd- measure 0, 

{0 I f(0) ^0} = {6 I g{6) ± 0} (7.10) 
(also see Theorem 19. 3 j) . 

3. For the case f3 n = 0, this holds if only Y^=o\ a n\ 2 < °°; see jZOl 
Section 10.7]. 

Finally, following Simon |69| Section 4.2], we want to state the con- 
nection of C to the Szego function, defined for \z\ < 1 by 



D(z) = lim tfXz)- 1 (7.11) 
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which exists and is nonzero if (and only if) 



$^Kf <°o (7.12) 

3=0 

(see Section 2.4 of [HH])- We will let Co be the free CMV matrix corre- 
sponding to dfj, = rp; equivalently, a n = 0. 

Theorem 7.4. Suppose 

oo 

^K|<oo (7.13) 

n=0 

T/ien C — Cq is trace class and 

D(z)- 1 D(0) = ddQ^^ (7.14) 
If (|7.12j) holds, then C — C is Hilbert-Schmidt, and 

D{z)- l D{U) = det 2 e"^ (7.15) 

where 

oo 

Wi = «o ~~ /J a .j'%--i (^-16) 
j'=i 

Remarks. 1. Alas, (4.2.53) of has a sign error: it is e -2 ™ 1 as we 
have here, not e ZWl as appears there! 

2. By det(£j*|), we mean det((l - zC)(l - zC o y l ). Since 

(1 - zC)(l - zCo)- 1 = 1 - z(C - C )(l - ^Co)- 1 

we see that this is 1+ trace class (resp., Hilbert-Schmidt) if C — Co is 
trace class (resp., Hilbert-Schmidt). 

3. For a proof, see Section 4.2 (Theorem 4.2.14) of [69J. 

4. C is the complex conjugate of C, that is, (C)»j = (C^). 

3. det(-) is defined on operators of the form 1 + A with A trace class, 
and then det 2 on 1 + A with A Hilbert-Schmidt by 

det(l + A) = det 2 {(l + A)e~ A ) (7.17) 

When A is trace class, 

det(l + A) = det 2 (l + A)e Tr{A) (7.18) 

If dZHSD holds, -zw ! = Tr((l - «C)/(1 - zC )) and (fTT^I /(|715 jl are 
consistent by (|7.18|) . See (3D] or [7Tj for a discussion of det(-) and 
det 2 (-). 
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5. The connection for one- dimensional Schrodinger operators of the 
Jost function and Fredholm determinants goes back to Jost-Pais 46J. 
For Jacobi matrices, under the name "perturbation determinant," they 
were used by Killip-Simon p^] . 

8. Essential Spectra 

The discrete spectrum of an operator is the set of isolated points of 
finite multiplicity. The complement of the discrete spectrum in the 
spectrum is called the essential spectrum. Since a CMV matrix has a 
cyclic vector, the essential spectrum is just the set of nonisolated points 
in the support of the spectral measure, d/i, often called the derived set 
of supp(<i/z). Last-Simon [S3] have a general result for the essential 
spectrum of a CMV matrix C({a n }^L ). 

Definition. A right limit of {a n }^L is any two-sided sequence 

{(3 n }'^L- 00 in D for which there exists ri£ — > oo so lim^oo a ne+ j = fij 
for each j G Z. ?l({a„}" =0 ) is the set of right limits of {a„}^ =0 . 

Since D is compact, 1Z is nonempty. Indeed, if /3 is any limit point 
of a n , there is a right limit with fi = /3 . 

Theorem 8.1 (Last-Simon [511). F° r an y { a n}%Lo e we have 

a css (C({a n }™ =0 )) = jj a(£({AJ~o)) (8.1) 

Remarks. 1. The proof [51] uses a Weyl trial sequence argument. The 
key is that because C has finite width, for any A G cr ess (C) and e, there 
exist L, rij — > oo and </?j supported in (rij — L,rij + L) with \\ipj\\ = 1 
and 

limsup || (C - X )fj\\ < e (8.2) 

2. Right limits of Verblunsky coefficients were considered earlier by 
Golinskii-Nevai [H], motivated by earlier work on Schrodinger opera- 
tors by Last-Simon . This work was in the context of a.c. spectrum 
(see Theorem 10.9.11(ii) of [701). 

3. [511 use d the same methods to study Jacobi and Schrodinger op- 
erators. Earlier results of the form (J7.1|) for Schrodinger operators (but 
not for CMV matrices) are due to Georgescu-Iftimovici |28| . Mantoiu 
[5^1, and Rabinovich [H31- These rely on what I regard as elaborate 
machines (connected with C*-algebras or with Fredholm operators) al- 
though, no doubt, their authors regard them as very natural. 

One can combine this with Theorem 15.31 to obtain 
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Theorem 8.2. Let {aj}^L and{(3j}JL Q be two sequences of Verblunsky 
coefficients. Suppose there exist Xj G <9D so that 

(i) PjXj - ay - (8.3) 

(ii) A i+1 Aj - 1 (8.4) 

Then 

Proof. Let { r yj}°Z_ 00 be a right limit of {/3j}^L . By passing to a 
subsequence, we can suppose \ nj — > Aoo and /3 % .+fc — > jk- Since 
A^+fcA" 1 — > 1, we see that a nj+k — > AooTifc- By Theorem EI31 

o-{£{{lk\T=-oo)) = ff (£({^oo7fc}r=-oo))- rt follows (using symmetry) 
that (}8~5|) holds. □ 

Remark. This proof is from j^lj, but the result appears earlier as The- 
orem 4.3.8 in motivated by a special case of Barrios-Lopez 

Example 8.3. (This is due to Golinskii [40] : the method of proof is due 
to [Hlj. See the discussion in [S3j for earlier related results.) Suppose 
\a n \ — > 1 as n — > oo. Then <T ess (C({a n })^L ) is the set of limit points 
of {—ctj + ia.j} c fL . For any limit point has £({f3j}fL Q ) diagonal (since 

(1 — |/9j| 2 ) 1//2 = 0) with diagonal values —(3j + i(3j, and by compactness, 
any limit point of — ctj+ictj occurs as some —fiiflo- In particular, this 
(plus an extra argument) implies o" ess (£) = {A } if and only if \a n \ — > 1 
and a n+ ia n — ► — A . See [IUJ and [51] for a discussion of when cr eS s(C) 
is a finite set. □ 

It is well known (see Example 1.6.12 of t 69j and Example 11.1.4 of 
[7Uj) that if a n = a e D, then a ess (C) = A\ a \ = {z e <9B \ \&rgz\ > 
2 arcsin(|a|)}, which increases as \a\ decreases. It follows: 

Example 8.4. (Theorem 7.8 of |54j : one direction was proven in 
which motivated Theorem 7.8 of [M].) Suppose 



liminf|aj| = a (8.6) 



Then 



a ess (C(K}~ ))=A a (8.7) 
For — > 1 implies that each limit is of the form (3j = b for some 



b e D, so 



b=limits of aj 

since A\u C A a if |6| > a. □ 
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9. Spectral Consequences 

Section 4.3 of [69j describes joint work of Golinskii-Simon [12] that 
uses CMV matrices to obtain spectral results that relate properties of 
{a n }™ to the associated measures. Here, in brief, are some of their 
main results: 



Theorem 9.1 (= Theorem 4.3.5 of [69J; subsumed in Theorem 
If K - p n \ -> 0, then a css (C({a n }^ =0 )) = a ess (C({AJ^ =0 )). 

Remark. Of course, Theorem 19 . II also follows from Theorem 18.11 

Proof. By (17. 6j) and a limiting argument, C({a n }^L ) — C({/3 n }^ =0 ) is 
compact. The result follows from Weyl's theorem on the invariance of 
essential spectrum under compact perturbation. □ 

Theorem 9.2 (= Theorem 4.3.4 of [HI])- If limsup|a; n (d/i)| = 1, then 
d[i is purely singular. 



Remark. This result is called Rakhmanov's lemma, after [64]. The 
proof is motivated by earlier results for Jacobi matrices of Dombrowski 
[TSj and Simon-Spencer 



Proof. Let a n be defined by 



a. 



if a n = 
if oc n ^ 

Since limsup|a n | = 1, we can find a sequence rij — > oo, so 



3=0 



« n , - a n J 1/2 < oo 



Let 



0n 



a n if n = Uj for some j 
a n otherwise 



Then C({p n }™ =0 ) - C{{a n }™ =0 ) is trace class by (|7!B|I . By the Kato- 
Birman theorem 



^ac(C({a„}- ))=a ac (C({/3 n }- )) 

Since \a n \ = 1, C({/3 n }^L ) is a direct sum of finite matrices of size 
Tij+i — Uj, and so it has no a.c. spectrum. □ 

Theorem 9.3 (= Theorem 4.3.6 of PJ). If {a n }™ = and {(3 n }n=o are 
the Verblunsky coefficients of dfi and dv given by (|7.8j) and (|7.9|) . anc? 
(1771) ZioWs, £/ten (TTTUJ) /ioWs. 
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Proof. If limsup|a n | < 1, then limsup|/3 n | < 1, and by Theorem 17. 3| 
(|7.1U|) holds. If limsup|a n | = limsup|/3 n | = 1, then cr ac (C({a n }^L )) = 
O r ac (C({/5 n }^ =o )) = 0byTheoremEl ' □ 

10. The AGR Factorization of GGT Matrices 

This section is primarily preparatory for the next and discusses GGT 
matrix representations (for Geronimus [SI], Gragg jlHj, and Teplyaev 
78\) associated to a measure on <9B: 

Qkt({<Xn}lL)) = (<Pk,Z<Pt) (Id) 

and discussed in Section 4.1 of [69j- If ^ is nontrivial, M in (|l().ljl is oo 
and a„eD for all n. If \x is supported on exactly TV points, M = N — 1 
and «o, • • • , oin-2 £ D, oin-\ G 9B. There is an explicit calculation (see 
Proposition 1.5.9 of p3]): 

(-a e a k ^ U.m=kPm 0<k<£ 
Gm= Ipe k = £+l (10.2) 

[o fc>£ + 2 

We present a remarkable factorization of GGT matrices due to Am- 
mar, Gragg, and Reichel ifj], use it to provide a result about cosets in 
U(N)/U(N — 1), and then present an alternate proof of Theorems 19.21 
and 19 .31 using GGT rather than CMV matrices. For the special case of 
orthogonal matrices (all ct,- G (—1,1)), AGR found this factorization 
earlier 

We defined Qj(ct) before (|3.18jl as a 2 x 2 matrix acting on the span 
of Sj, Sj+i. We define 0(atj) to be this matrix viewed as an operator on 

by lj © Oj(a) © ljv-j-2- ©7v-i(ct) is the matrix ljy-i © «• 

Theorem 10.1 (AGR factorization). For any finite N x N GGT ma- 
trix, 

G{{a}^) = 6o(a ) • • • e iV _ 2 (a J v- 2 )e iV _ 1 (a i v-i) (10.3) 
For N = oo, 

G{Mn=o) = s-iim e (« ) . . . e M (« M ) (10.4) 

Remarks. 1. ()10.4|) follows from ()1U.3|) by a simple limiting argument. 
We will only prove (|1U.3|) below. 

2. We will give three proofs which illustrate slightly different aspects 
of the formula. 
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3. As explained in Killip-Nenciu [HO], the Householder algorithm 
lets one write any unitary as a product of iV — 1 reflections; in many 
ways, the representation (|1U.4|) is more useful. 

First Proof. By a direct calculation using (|10.2|) . 

£(K}n=o) = ©o(a )[lixi © G({a n+1 }^- 2 )} (10.5) 
(jl0.3|) follows by induction. □ 

Second Proof (that o/AGR 6J). We will prove first that any unitary 
(upper) Hessenberg matrix H (i.e., Hm = if k > i + 1) with positive 
subdiagonal (i.e., Hi+ij > for all £) has the form ()10.3[) for suitable 
ai, . . . , otN-2 G D and qjjv-i G <3D. The first column of H has the 
form (ao, po) 0, . . . , 0)* for some ao G D. Then 9o(ao) _1 -f^ is of the 
form l lxl ®H^ l \ where H^ 1 ' is a unitary (iV — 1) x (iV — 1) Hessenberg 
matrix with positive subdiagonal. By induction, H has the form (jl0.3|) . 
One proves that {ct n }n=o are the Verblunsky coefficients of the GGT 
matrix, either by using ()10.2|) or by deriving recursion relations. □ 

For the third proof, we need a lemma that is an expression of Szego 
recursion. 

Lemma 10.2. We have that 

((p*,Z(pj) = OLj (10.6) 
{(Pj+i, z(Pj) = Pi (10.7) 
(<Pi+i,<P* j +i) = -a j (10.8) 

Remark. This says that a certain change of basis on a two-dimensional 
space is 0(oj). 

Proof. <fj + i J_ (p* since deg(<p*) < j. Moreover, by ()2.5j) and ()2.12j) . 

zipj = pjVj+i + aj(p* 

<P*+1 = -OLj<p j+1 + Pjifj 

from which ()10.6|) — f)10.9|) are immediate. □ 

Third Proof of Theorem MO.il This is an analog of the proof of CM. 
factorization in Section EJ There C is the matrix of overlap of the 
orthonormal bases {zxi}^ and {Xi}iLo- The CM. factorization comes 
from inserting the basis {za^}^ . Here we have the bases 

e (0) = (% (10.10) 

e (A ° = (<Po, ■ ■ -,<Pn-i) 
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and Q is an overlap matrix. We introduce N — 1 intermediate bases: 

, Z<f N -j-l, ¥*N-j' ^N-j+l, ■ ■ ■ , (PN-l) 

where is given by ()10.10|) since (p$ — 1 = (p . 
Thus 

= V (p {N) P ( N -V\ I P i N -3) p {N-j-l)\ / p (l) p (0)\ 
/ , , \°fc ' °mi / • • • \°mj ' mj+i / • • • \°mjv_i' °£ / 

mi...mjv-i 

(10.11) 

is a product of N matrices. N — 1 have a change from zyjj, (fj + i to y?*, ^ 
whose overlap matrix, by l)l().6J) - (|10.9)l . is 0(oj) and the extreme right 
has a change from z(fN-i to f^-ii which is 0(ajv-i) since in L 2 (<9D), 

z^ N _i - cx N ^ X ip* N _ x = 

Thus, (gnu is (nnn. □ 

As a first application, we want to show that each finite unitary has 
an CM factorization without recourse to orthogonal polynomials. By 
taking limits, one obtains an CM factorization in general. This cal- 
culation fleshes out an argument given by AGR [S] in the orthogonal 
case by using induction to make the proof more transparent: 

Theorem 10.3. Let U be a unitary matrix on C N with (1, 0, . . . 0)* as 

cyclic vector. Then there exists a unitary V on C N with V(l, 0, ... 0)* = 
(1, 0, . . . , 0)* so that VUV^ 1 has an CM factorization. 

Remark. By CM factorization, we mean C = O(cto) © ©(a^) © ' ' ' an d 

M = lixi © 0(«i) © 0(013) © ' ' ' j with a 0(«Ar_i) at the end of C if 
N is odd and of M is N is even. 

Proof. We use induction on iV. N = 1, which says U = (0(«o)(l), 
is trivial. By the GGT representation and AGR factorization, we can 
find W (with W(l, 0, . . . , 0) 4 = (1, 0, . . . , 0)*) so 

WUW- 1 = o (a o )0i(oi) • • • Q N -i(a N -i) 

Let 

U 1 = e (a 1 )...e N -2(a N -x) (10.12) 



e (1) = (z(p , ■ ■ ■ 
e (2) = {z(p , ■ ■ ■ 

e {3) = (z(p , ■ ■ ■ 
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on C^ -1 . By induction and adding li x i © • • • everywhere, we can find 
£1, Mi, and V\ so 

(1 © VJWUW- 1 ^ + VxY 1 = 9 (a )[l © A][l © Mi] (10.13) 
Define 

(note F maps (1 ... 0)* to itself). We have 

VUV- 1 = {[1 © Mi]0 o (a o )}{[l © £i]} 
which precisely has the form CM. □ 

As a second application, we want to provide an explicit map that 
will be critical in the next section. Fix S G C n and U G U(n), the 
n x n unitary matrices. Let U(n — 1) = {U G U{n) \ US = S }. The 
symbol U(n — 1) is accurate since each such U defines and is defined by 
a unitary on {So} 1 - = C n_1 . Let SC 2 ^ 1 = U(n)/U(n - 1), the group 
theoretic quotient. By mapping 

tt: U G U(n) -> C/5 (10.14) 

we see that §C 2n_1 = {z£ C n | |z| = 1}, the sphere of real dimension 
2n — 1. Here is the result we will need: 

Theorem 10.4. There exists continuous maps g\ and g 2 defined on 
{z G SC 2 "- 1 | z ^ S } with g x mapping to U(n) and g 2 to SC 2 "" 3 = 
{z G SC 2 "" 1 | (5 , z) = 0} so 
(i) 

n[ gi (z)]=z (10.15) 

(ii) V(17) = g^U))' 1 !/ G U(n - 1) for all U (£U(n- 1) 

(iii) If Sq is cyclic for U with Verblunsky coefficients aij(U,6o), then 
g 2 {<K{U)) is cyclic for V{U) \£. n ~ 1 and 

a j (V(U),g 2 (7r(U))) = a j+1 (U,S ) (10.16) 

(iv) 

(S , US ) = a (U, So) (10.17) 

if So is cyclic for U. 



Proof. If z 7^ So, a(z) = (S , z) G D and so 

z - (S ,z)S 



92(z) = 



\z - <<5 , ^><5o || 

is well defined and in SC 2 ™ -3 . In particular, if p(z) = (1 — a(z) 2 ) 1 / 2 = 
\\z — {S ,z)So\\, we have 



z=p{z)g 2 {z)+a(z)So (10.18) 
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Define gi(z) by 

{w if w±5 ,g 2 (z) 

z if w = S 

-a(z)g 2 {z) + p{z)5 if w = g 2 (z) 

and otherwise linear. Then gi(z) is unitary since Q(a(z)) is unitary, 
(i) is obvious from gi(z)So = z. (ii) is a restatement of (i). (iii) follows 
from the fact that g 2 (z) corresponds to 8\ in a 8j = Xj( z ) basis and 
the AGR factorization, (iv) is a consequence of z(p — a y?o = V^i? so 
(tp , zipo) = a (fo, fo) = ato- □ 

We want to close this section by noting that the AGR factoriza- 
tion implies an estimate on the GGT matrices that is not obvious 
from (jl0.2|) . Indeed, in Section 4.1 of [69 , an unnecessary condition, 
liminf|a n | > 0, is made because the estimate below is not obvious. 

In essence, the AGR factorization plays the role for estimates of 
GGT matrices that the CM. factorization does for CMV matrices. In 
some ways, it is more critical because CMV matrices are five-diagonal 
with matrix elements which are quadratic in a and p, so one can easily 
get estimates like ()7.2|) (but with a worse constant) without using the 
CA4 factorization. Since GGT matrices are not finite width and have 
matrix elements that are products of arbitrary orders, direct estimates 
from (jl(J.2J) are much harder. 

Theorem 10.5. Let {a n }^L and {/3 n }^l be two sequences in D°° and 
leta n = (1 - \p n \ 2 ) 1/2 - Then 

oo 

\\g({a n }™ =0 ) - 0({&}£=o)||i < 2 E (K - &I + K - Ail) (10.19) 

?1=0 

Proof. By fll().4j) and standard trace class techniques [H] , we need only 
prove for finite sequences that 

N 

\\e (a ) ■ ■ ■ & N (a N )-e ((3o) . . . ejv(/3iv)||i < 2^2(\a n -/3 n \+\<T n -p n \) 

n=0 

(10.20) 

Since ||G_j-(a 3 -) — ©j(/%)||i < 2(|aj — + \o-j — pj\), and ||0(a)|| = 1, 
writing the difference of products as a telescoping sum yields ()10.20|) . 

□ 

Notice also that AGR factorization shows that if |oj-| = 1, Q decou- 
ples. This fact and ()10.19|) provide an alternate proof of Rakhmanov's 
lemma ( Theorem 19. 2j) by the same decoupling argument, but using Q 
in place of C. If X]^Lol a «l 2 = 00 ' one a ^ so S e ^ s a P ro °f °f Theorem 19.31 
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using Q in place of C and (|l().19j) . If X]^Lol a «l 2 < 00 > one mus ^ use the 
extended GGT matrix, J 7 , of Section 4.1 of [UH] (see also Const ant inescu 
[To]). It is easy to prove that if X^l a "| 2 < 00 anc ^ — Pn\ < °°! 

then jF({a„}^ ) — jF({/3 n }^L ) is trace class since the difference of ^-"'s 
differs from the difference of (?'s by a rank one operator, which is always 
trace class! 



11. CUE, Haar Measure, and the Killip-Nenciu Theorem 
In jSD], Killip and Nenciu proved the following result: 

Theorem 11.1. Let dfi be a normalized Haar measure on U(n), the 
n x n unitary matrices. Then, for a.e. U, So = (1, 0, ... , 0)* is cyclic, 
and the measure induced on D n_1 x <9ID> by U — > aj(Uo, 5q) is the product 
measure: 



n 

j=0 



11 1 ' (1 n/ J )" -d-<\ : 



71 



1 d9(a n -i) 
J 2tt 



11.1) 



where 9(a n ^i) is defined by 

a n _! = e^ ™- 1 ) (11.2) 

and d 2 a is a two-dimensional Lebesgue measure on D. 

Remark. By the "induced measure," we mean the measure dv on D n_1 x 
83 given by u(B) = ^(A -1 [£?]), where A{U) = (a^U), a n ^{U)). 

This is really a result about Verblunsky coefficients, not CMV ma- 
trices, and both their proof and ours use the GGT, not the CMV, 
matrices. We provide this here because, first, Killip-Nenciu proved 
this result to provide a five-diagonal model for CUE (see below), and 
because the result was proven as part of the ferment stirred up by the 
CMV discovery. In this section, we will provide a partially new proof 
of Theorem 111.11 that is perhaps more natural from a group theoretic 
point of view, and then describe and sketch their somewhat shorter 
argument! 

To understand where the factors in (jll.ljl come from: 

Lemma 11.2. Let d/xgc 2 ™- 1 be the measure on the 2n — l real dimension 
manifold {z G C n | \z\ = 1}, which is normalized and invariant under 

rotations. Map SC 2 ™" 1 D by z 1— > z\ , the first component, and let 
dv be the measure on D given by v{B) = /z§ C 2n-i((5 -1 [£>]). Then 

du(w) = - — - (1 - \w\ 2 ) n - 2 d 2 w (11.3) 

7T 
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Proof. Since d 2 w = \d6 d\w\ 2 , 

du(w) = ^ - l) - [\l-x) n - 2 dx = l 

2 J Q 

so dv is normalized. Thus, we will not worry about constants. Using 
x\ + 1x2, x% + 1x4, ... for the n complex variables, the measure 5(1 — 
|x| 2 )iixi . . . dx 2n is 

dX\ . . . dX2n—\ 

2(1 -e?:A 2 ) 1/2 

Integrating out x%, . . . ,X2n-i for fixed 21,22 with p = (1 — |xi| 2 — 
\x 2 \ 



I 2 ) 1 / 2 , the measure is 

1 f d 2n ~ 3 y 



2 4l<(i-p 2) v 2 (l-p 2 -y 2 y/ 2 
Scaling x = y/(l — p 2 ) 1 ^ 2 , we find 



2^ " 7 ./,„<! (1-^)1/2 

so the measure is C(l — w 2 ) n ~ 2 d 2 w, proving (jlOjl . □ 

Theorem II 1 .41 below must be well known to experts on homogeneous 
spaces. 

Lemma 11.3. Let dv\ 1 dv2 be two probability measures on compact 
spaces X and Y , and let dv be a probability measure on X xY. Suppose 

(i) -K*(dv) = dv\, that is, ifiri(x,y) = x, then v\(B) = ^(vrf 1 [£>]). 

(ii) For any continuous f on X, J x f dv = C / dv 2 , that is, 

f(x)g(y)dv = C f [ g(y)dv2 (11.4) 



for all continuous g on Y. 
Then dv = dv\ ® dv2. 

Proof. Taking g = 1 in (jll.4j) . 

c f= f(x)dv= I f(x)dvi(x) 



by (i). Thus, 

f(x)g(y)dv= [ I f(x)dv 1 (x) )( / g(y)dv 2 (y) 



so dv = dv\ <g> dv 2 integrated on product functions which are total in 
C(X x Y). □ 
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Theorem 11.4. Let G be a compact group and H a closed subgroup. 
Let dv G ,dvn be normalized Haar measures and n\ G — > G/H. Let 
dvQ/H be the measure induced by dv G on G/H, that is, 

vg/h{B) = u G (n- 1 [B]) 

Let O be an open set in G/H and f : O — > G a continuous cross- 
section, that is, 7T [/(#)] = x for all x G O. Coordinatize 7r _1 [0] by 
O x H via 

(x,h)^f(x)h (11.5) 

Then, on 7r _1 [0], 

dv G (x, h) = dv G / H (x) dv H (h) (H-6) 

Proof. The existence of a cross-section implies that under the coordi- 
nates (fTOjl . n-^O] = O x H. Clearly, ir$(dp: G ) = dfi G/H \ O, by 
construction of dp,a/H- On the other hand, J f d^a/H is a measure on 
H invariant under right multiplication by any h G H, so this is Cfdfiu- 
Therefore, Lemm dT 1.31 applies and (jll.6|) holds. □ 

Proof of Theorem Ml.il We use induction on n. n — 1, that is, that for 
U(l), [/ = (e ie °) has Haar measure is immediate. 

Note that U G U(ra) has Sq as a cyclic vector if and only if U has sim- 
ple spectrum, and for each eigenvector ipk of U, we have 5q) 7^ 0. 
As is well known, {{7 | Z7 has a degenerate eigenvalue} has codimen- 
sion 3 and so zero Haar measure. Similarly, (ip^, So) = on a set of 
codimension 2 and so zero Haar measure. Thus, C n = {E | 5q is cyclic 
for U} has full Haar measure. 

Let O = {r] G SC 2 ™" 1 | r] ^ 5 }. Then /(a;) = g^z) given in 
Theorem I1U.4I is a cross-section, and so d//u( n ) = g^sc 2 ™- 1 ® ^u(n-i) 
by Theorem 111.41 

By Theorem HIOl ^(^nD is («, V(C/)) and V(U) has Verblunsky 
coefficients {aj + i(U)}™~ . Thus, by induction, dfj,^-!) 011 these a's is 
the product ()11.6|) without the «o factor. 

By ()10.18|) and Lemma 111.31 the «o distribution generated by 
d/zsc 2 ™- 1 is the a factor in ([ll.ip . □ 

The proof in jHU] differs in two ways: First, in place of the AGR fac- 
torization, Killip-Nenciu use a (Householder) factorization as a phase 
factor times a product of reflections. Instead of using induction on sym- 
metric spaces as we do, they use an alternate that would work with the 
AGR factorization also. Starting with ipo = So, we let ipo = Utp. There 
is a unique vector, ifi (what we called g2ij{U)) in Theorem I10.4|) . in 
the span of ip and ipo, so that (ipo, fi) > and (ipi, So) = 0. tp\ is cyclic 
for V(U) and so, by induction, we obtain if) , ipi, . . . , ipn-i an ON basis 
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with \{Sj,ipj)\ < 1. It is not hard to see that, via the AGR factoriza- 
tion, this sets up a one-one map of ON basis with | (Sj, tpj) \ < 1 and C/'s 
with Sq cyclic for U. Haar measure induces on the ^'sa measure as fol- 
lows: ipQ is uniformly distributed on SC 2n_1 ; ipi uniformly on the copy 
of §C 2n ~ 3 of unit vectors orthogonal to ipx] ip2 uniformly on §C 2ra_5 , 
etc. Since (Sj,ipj) = &j, we obtain the measure ([ll.ip . 

Since det(G J -(a J -)) = —1, det(Ojv-i(ajv-i)) = cijv-i, we see 

det(S(K}n=o) = (-lf-Wx 

Thus, §U(n) = {U G U(n) | det(U)} is precisely these U with (%n-i = 
(— l)^ -1 . The same inductive argument thus proves: 

Theorem 11.5. Let dp, be normalized Haar measure on SU(n). Then 
for a.e. U, So — (1, 0, . . . , 0)' is cyclic and the measure induced on D n_1 
by U — > Qj(U, S ) (with a n _x(U, S ) = (— l) n_1 ) is the product measure 
given by (jll.lj) with the final d9 term dropped. 

SO[n] is the n x n real unitary matrices (i.e., orthogonal matrices). 
If So is cyclic, they have Verblunsky coefficients which are easily seen 
to lie in (—1,1). Conversely, it is easy to see that if ctj G (—1,1) for 
j = 0, ... ,n — 2, there is an orthogonal matrix with those <x,'s. A 
similar analysis lets us compute the distribution on (—1, l) n_1 induced 
by Haar measure on SO[n]. We need only replace Lemma fl 1.31 by 

Lemma 11.6. Let dn n ^i be the measure on the n — 1-dimensional unit 
sphere in M n . The induced measure on x\ is 

r(f)(l- | a ; 1 |2)(rx-3)/2 ^ 

Proof. That the measure is C(l— \xi\ 2 )^ n ~ 3 ^ 2 ^ dx% follows from the same 
calculation as in Lemma fl 1.31 The normalization is the inverse of the 
beta function 2 2 ~T(n — l)/r( !i ^-) which, as noted by [5U] (there is a 
( ) _1 missing on the leftmost term in their (3.14)), can be written, using 
the duplication formula for beta functions, as Y / ^/nY (J^) . □ 

We thus have 

Theorem 11.7 (jEI3j)- The measure on (— 1, l) n_1 induced by Haar 
measure on SO[n] mapped to the real Verblunsky coefficients is 
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The CUE eigenvalue distribution [HI EU ED is the one for U e 
U[n] induced by Haar measure. Weyl's integration formula (see, e.g., 
says that if A 1; . . . , \ n with A, = e j are the eigenvalues, this is 



C rTi<jl^i — <\j 1 2 YYj=i ^f- Theorem 1 1 1 . 1 1 savs that CMV matrices with 
distribution of a's given by (jll.lj) has the same distribution, and so 
gives a model for CUE by five-diagonal matrices. [3U| find a similar 
model for the "/3-distributions" given by Cp Y\_ i= A \ — see also 

Forrester-Rains 1251 . 



12. CMV and the AL Flow 

One of the great discoveries of the 1970's (0 EH EEl E23 E3 
and dozens of other papers) is that lurking within one- dimensional 
Schrodinger operators and Jacobi matrices is a completely integrable 
system (resp., KdV and Toda flows), natural "invariant" tori, and a 
natural symplectic structure in which the Schrodinger operator or Ja- 
cobi matrix is the dynamical half of a Lax pair. 

Such structures occur also for Verblunsky coefficients, and the dy- 
namical half of the Lax pair is the CMV matrix. While the CMV part 
obviously requires CMV matrices, the other parts do not, and it is 
surprising that it was only in 2003-04 that they were found. We will 
settle here for describing the two most basic structures, leaving further 
results to mentioning the followup papers: Geronimo-Gesztesy-Holden 
[3Hj . Gesztesy-Zinchenko (SHI, Killip-Nenciu [3Tj, Li [33], and Nenciu 
M- 

On D, introduce the symplectic form given by the Poisson bracket 
(where, as usual, and J| are ~[J^ =F ^]), 



{f,g} = *P 2 



df_ dg_ 

dz dz 



df_ dg_ 

dz dz 



;i2.i) 



The p 1 is natural as we will see below. Extend this to W (coordinatized 
by (ao, • • • , Op_i) by 

p-i 



ff | = zV 2 d ^ — d ^ — 



12.2) 



Because of the p 2 



p-i 
Un 

j=0 



P%9 



j=0 ^ j=0 



and functions of p • • ■ Pp-i generate simultaneous rotations of all 
phases. Nenciu-Simon [H2I proved the following: 
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Theorem 12.1 ([H2])- Let p be even and let A(z,{aj}^Z ) be the dis- 
criminants (see (|3.20p ) for the periodic sequence with ctj+kp = ctj for 
j = 0, . . . ,p — 1; k = 0, 1, 2, Then, with respect to the symplectic 

form firm . 

{A(w),A(z)} = (12.3) 

/or allw,ze C\{0}. 

iVote: See [SHI EI] for a discussion of symplectic forms on unitary 
matrices. 

Since A(l/w) = A(w), and the leading coefficient is real, A(z) has 
p real coefficients, that is, A(z) = Yl P j=- p /2 a i z ^ w ifh a -j = then 
dp/2, Re dp/2-i, Im dp/2-i, . . . , Re ai, Im di, do are the p real functions of 
{aj}^Zo which Poisson commute. They are independent at a.e. points 
(in a) and define invariant tori. Each one generates flows that are 
completely integrable. The simplest is 

-ictj = p){a j+ i + aj-i) (12.4) 

which has been known as a completely integrable system for a long time 
under the name "defocusing Ablowitz-Ladik flow" (after [H 121 El)- 
Nenciu has proven a beautiful result: 

Theorem 12.2 ([SHI EDI HH])- The flows generated by the coefficients 
of A can be put into Lax pair form with the dynamical element being 
the Floquet CMV matrix. 

For details as well as extensions to some infinite CMV matrices, see 
the references above. 

The flow generated by rij=o p] rea h zes f ne a j ~^ invariance of 
the isospectral tori. The flow ()12.4|) is generated by Re(di). The Im(dfc) 
generate flows that preserve the set of {ctj}^zl where all aj are real (as 
a set, not pointwise). The simplest of these, generated by Im(di), is 

a n = pl(a n+ i - a„_i) (12.5) 

called the Schur flow. Via the Geronimus relations of the next section, 
these generate a flow on Jacobi parameters that is essentially the Toda 
flow. For further discussion, see jU 123 EH] • 

13. CMV Matrices and the Geronimus Relations 

In a celebrated paper, Szego [TB] found a connection between or- 
thogonal polynomials for measures on [—2, 2] (he had [—1, 1]; I use the 
scaling common in the Schrodinger operator community) and OPUC. 
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Given a measure on [—2,2], one defines the unique measure on 
<90 which is invariant under z — > z and obeys 

g{x)d 1 {x) = j g{2cosB)di{e) (13.1) 

What Szego showed is that the orthonormal polynomials p n for d*y and 
the OPUC for ip n for d^ are related by 

Pn (z + i J = C n 2r n (</? 2n (z) + <^ n (z)) (13.2) 

The normalization constants C n (see (13.1.14) in [70]) -> 1 as n -> 
oo if ^^Lol a "| 2 < 00 • Motivated by this, Geronimus [33] found a 
relation between the Verblunsky coefficients, a n , for <i£ and the Jacobi 
parameters {a n }'^L l) {6 n }^ =1 for d^y (see Theorem 13.1.7 of [70]): 

a n+l = (1 - «2n-l)(l - «2n)(l + a 2n+l) (13.3a) 
b n+1 = (1 - tt 2 n-l)tt2n ~ (1 + a2n-l)«2n-2 (13.3b) 

In jnUJ, Killip-Nenciu found a direct proof of (|13.3|) by finding a 
beautiful relation between CMV and some Jacobi matrices. We will 
sketch the idea, leaving the detailed calculations to [50] or the peda- 
gogic presentation in Section 13.2 of [70] . 

A measure is invariant under z — > z if and only if all {a„}™ =0 are 
real. 0(a) with a real is selfadjoint and unitary with determinant — 1, 
hence eigenvalues ±1, that is, a reflection on C 2 . Thus, 

a„ = 6t v all n =>- Ai 2 = C 2 = 1 



Since Xn(z) = x n (l/z), we see that if \i is invariant and (Mf)(z) = 
f{z), then 

(Xj,M X i) = Mjt 

C + C* is selfadjoint and maps {/ | Aif = /} to itself. Let us see in 
a natural basis that its restriction to this invariant subspace is a Jacobi 
matrix. 

If a is real and 

q, \ _ J_ / y/l — a + a N 

then 

5(a)9(a)5(a)" 1 = (13.4) 

Define 

5 = li x i © 5(ai) © S(a 3 ) © ••• 
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so 



SMS 



-i 



n 



\ 



\ '•• / 

and define 

B = SCS- 1 

Then 

5(C + C -1 )5 _1 = TZB + Bn 
which commutes with 1Z. 

B is seven-diagonal as a product of three tridiagonal matrices. More- 
over, since B commutes with TZ, its odd-even matrix elements vanish. 
It follows that 

TIB + BK = J e © J 

where J e acts on {5 2n }^i and J Q on {(5 2n +i}^o; an d eacn ^ s a J ac °bi 
matrix. A calculation shows that the Jacobi parameters of J e are given 
by ()13.3|) and that the spectral measures are related by (|13.1|) . One 
can also analyze J Q which is related to another mapping of Szego jZH] 
and one gets two more Jacobi matrices by looking at C + C~ l restricted 
to the spaces where £ = 1 or £ = — 1. 
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